Information retrieval on Turkish texts
نویسندگان
چکیده
منابع مشابه
Information retrieval on Turkish texts
We study information retrieval (IR) on Turkish texts using a large-scale test collection that contains 408,305 documents and 72 ad hoc queries. We examine the effects of several stemming options and query-document matching functions on retrieval performance. We show that a simple word truncation approach, a word truncation approach that uses language dependent corpus statistics, and an elaborat...
متن کاملEffects of diacritics on Turkish information retrieval
We investigate the effects of improper use of diacritics in the Turkish alphabet on information retrieval. A diacritic is simply a supplementary sign added to a letter to change the sound value of the letter, and the Turkish alphabet has 5 special letters derived from Latin by adding different diacritics. The statistical analysis performed in this study shows that retrieval performance signific...
متن کاملInformation Retrieval from Annotated Texts
Methods for the correct and eecient handling of annotations in a full-text retrieval system are investigated. The problem with annotations is that they cannot be treated as regular text, since this would disrupt proximity searches, but on the other hand, they cannot be ignored, as they may carry important information. Moreover, in some cases, a user may wish to restrict a search to prespeciied ...
متن کاملdesigning integrated information retrieval system for farsi texts
extension of information together with the need for its use in a suitable and appropriate time is one of the important goals in this century – the information century. the user query that is accessibility to the requested text information in a short time must be satisfied using effective techniques. this is performable under text compression and retrieval, which have been treated separately bef...
متن کاملInformation Retrieval Effectiveness of Turkish Search Engines
This is an investigation of information retrieval performance of Turkish search engines with respect to precision, normalized recall, coverage and novelty ratios. We defined seventeen query topics for Arabul, Arama, Netbul and Superonline. These queries were carefully selected to assess the capability of a search engine for handling broad or narrow topic subjects, exclusion of particular inform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Society for Information Science and Technology
سال: 2008
ISSN: 1532-2882,1532-2890
DOI: 10.1002/asi.20750